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^ | Abstract 
00 

In a three-node network a half-duplex relay node enables bidirectional communication between 

CO 

f^*) ' two nodes with a spectral efficient two phase protocol. In the first phase, two nodes transmit their 

message to the relay node, which decodes the messages and broadcast a re-encoded composition in the 
C/3 • second phase. In this work we determine the capacity region of the broadcast phase. In this scenario 

each receiving node has perfect information about the message that is intended for the other node. The 



resulting set of achievable rates of the two-phase bidirectional relaying includes the region which can 
be achieved by applying XOR on the decoded messages at the relay node. We also prove the strong 
converse for the maximum error probability and show that this implies that the [e^, e 2 ]- ca P ac ity region 
defined with respect to the average error probability is constant for small values of error parameters e 1; 
£2- 

I. Introduction 

Future wireless systems should offer connectivity almost everywhere. This objective represents 
an ambitiously engineering challenge in scenarios where the direct link between two nodes 
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does not have the desired quality, e.g. due to shadowing or distance. On that score, multi-hop 
communication for coverage extension and meshed network architectures are currently discussed 
or scheduled in all wireless networks standards of the next generation. Therefore, the relay 
channel experiences a revival recently. The problem was introduced by van der Meulen in [1] 
in the early seventies. A few years later, Cover and El Gamal obtained the capacities of the 
physically degraded and reversely degraded relay channels and upper and lower bounds on the 
capacity of the general relay channel in [2]. The general problem is still unsolved. Fundamental 
insights about the general problem and recent development can be found in [3] and references 
therein. 

We consider a three-node network where one node acts as a relay to enable the bidirectional 
communication between two other nodes. The two-way communication problem without a relay 
node was introduced by Shannon in [4] in 1961 already. Therein, he obtained the capacity region 
for the average error for the restricted two-way channel, i.e. a feedback between the two nodes 
is not allowed. Nowadays, this is regarded as the first network information theory problem. 

In information theory it is often assumed that the nodes can transmit and receive at the same 
time, i.e. full-duplex nodes. This assumption is in wireless communication hard to fulfill, since 
it is practically difficult to isolate a simultaneously received and transmitted signal using the 
same frequency sufficiently. Therefore, in this work we assume half-duplex nodes. As a natural 
consequence of this assumption is that relay communication is performed in phases. Often the 
relay communication should be integrated in existing infrastructures and most protocol proposals 
base usually on orthogonal components which require exclusive resources for each link. As a 
consequence they suffer from an inherent loss in spectral efficiency. This loss can be significantly 
reduced if bidirectional relay communication is desired. Because then the communication can 
be efficiently performed in two phases. In the first phase, the multiple access phase (MAC), the 
information is transmitted to the relay node. In the succeeding broadcast phase (BC), the relay 
node forwards the information to its destinations. In [5] and [6], where Gaussian channels are 
considered, the relay performs superposition encoding in the second phase. The knowledge of 
the first phase allows the receiving nodes to perform interference cancellation before decoding so 
that effectively we achieve interference-free transmission in the second phase. Another interesting 
approach [7], [8] is based on the network coding principle [9], [10] where the relay node performs 
an XOR operation on the decoded bit streams. But since network coding is originally a multi- 
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terminal source coding problem, such an approach operates on the decoded data and therefore 
does not deal with channel coding aspects. 

Because of our practical motivation, we apply time-division to separate the bidirectional relay 
communication into two phases. The optimal coding strategy and capacity region of the general 
multiple access channel is known. In this work, we present the optimal broadcast coding strategy 
of the two-phase bidirectional relay channel based on classical channel coding. It shows that all 
rate pairs in the capacity region can be achieved using an auxiliary random variable taking two 
values, i.e. we achieve the capacity region by the principle of time-sharing. Thereby, we see an 
interesting connection to a joint source and channel coding approach for the broadcast channel 
based on Slepian-Wolf coding [11]. 

In a multi-terminal system the average and maximal error capacity region can be different, even 
in the case of asymptotically vanishing errors as is shown by Dueck in [12]. While for single-user 
channels it is of no importance whether we use vanishing average or maximal probabilities of 
error in the definition of achievable rates, the choice of the error criterion makes a big difference 
if we pass to the consideration of the strong converses for one-way channels. Indeed Ahlswede 
demonstrated in [13] that the strong converse does not hold for the compound channels if we use 
the average probability of error for the definition of ^-achievable rates but it is well known that 
the strong converse is valid if we use maximal error probabilities as was shown by Wolfowitz 
[14]. For these reasons, we will pay a lot of attention to the consideration of the maximal and 
average error probabilities and the relation between them in the main part of the paper and in 
the proofs. 

The paper is organized as follows: In the following two subsections we present the two-phase 
bidirectional relay model, which describes the context of the bidirectional broadcast channel and 
after that we briefly restate the MAC capacity region for completeness. In Section UU we prove 
a coding theorem and a weak converse for the maximum error probability. The proof shows 
that the capacity region is independent of whether we use asymptotically vanishing average or 
maximum probability of error. In Section [III] we prove the strong converse for the maximum error 
probability using the Blowing-up Lemma [15]. Finally, from this we can deduce that the [e±, 
capacity region in terms of average probability of error is constant for all [e-y, e 2 ] G (0, |) x (0, |) 
or G (0, |) x (0, \) and equals the e 2 ]-capacity region defined with respect to maximum error 
probability in that range of values of [£1,62]- Based on the capacity regions of the two phases 



4 




Fig. 1. Multiple access (MAC) and broadcast (BC) phase of the time division bidirectional relay channel. 



the time-division between MAC and BC phase can be optimized. This gives us the largest 
achievable rate region for the finite alphabet discrete memoryless bidirectional relay channel 
under the simplification of time-division into two phases, which will be discussed in Section iTVl 
by means of a binary channel example. 

A. Two Phase Bidirectional Relay Channel 

We consider a three-node network with two message sets W\ and W2. In our bidirectional 
channel we want the messages w± E Wi located at node 1 and the message W2 £ W2 located 
at node 2 to be known at node 2 and node 1, respectively. We assume that there is no direct 
channel between node 1 and 2. Therefore, node 1 and 2 need the support of a relay node R. 

We simplify the problem by assuming an a priori separation of the communication into two 
phases. Furthermore, we do not allow cooperation between the encoders at node 1 and node 2. 
Otherwise, a transmitted symbol could depend on previously received symbols. For a two-way 
channel this is known as a restricted two-way channel. With this simplification we end up with 
a multiple access phase, where node 1 and 2 transmit messages ivi and w 2 to the relay node, 
and a broadcast phase, where the relay forwards the messages to node 2 and 1, respectively. We 
look at the two phases separately. After that we will briefly consider the optimal time-division 
between the two phases. 

In the multiple access phase (MAC) we have a classical multiple access channel, where the 
optimal coding strategy and capacity region Cmac is known [16], [17]. We will restate the 
capacity region in the next subsection. Thereby, let and R^> denote the achievable rates 
between node 1 and 2 and the relay node in the MAC phase. 

For the broadcast phase (BC), we assume that the relay node has successfully decoded the 
messages w± and w 2 in the multiple access phase. From the union bound we know that the error 
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probability of the two-phase protocol is at most the sum of the error probability of each phase. 
Therefore, an error-free MAC phase is reasonable if we assume rates within the MAC capacity 
region and a sufficient coding length. From this we have a broadcast channel where the message 
Wi is known at node 1 and the relay node and the message w 2 is known at node 2 and the relay 
node, as depicted in Figure [U Thereby, let x\, x 2 and x denote the input and y x , y 2 and y the 
output symbols of node 1, node 2, and the relay node, respectively. Furthermore, let R^+ and 
denote the achievable rates between the relay node and node 1 and 2 in the BC phase. 
The mission of the relay node is to broadcast a message to node 1 and 2 which allows them 
to recover the unknown source. This means that node 1 wants to recover message w 2 and node 2 
wants to recover message W\. We will present an information theoretic optimal coding strategy 
and the capacity region of the bidirectional broadcast channel in Section UH 

B. Capacity Region of Multiple Access Phase 

In this subsection, we restate the capacity region of the multiple access channel, which was 
found by Ahlswede [16] and Liao [17] and is part of any textbook on multiuser information 
theory, e.g. [18]. 

Definition 1.1: A discrete memoryless multiple access channel is the family {p^ : X[ l x 
X£ — > y n }neN with finite input alphabets X k , k = 1,2, and the finite output alphabet y where 
the probability transition functions are given by p^ n \y n \xi, x 2 ) ■= Yfi=i P{yi\ x iXi x i2) f° r a given 
probability transition function {p{y\x 1 ,x 2 )}x 1 ex 1) x 2 ex 2 ,yey- 

Theorem 1.2: The capacity region Cmac of the memoryless multiple access channel is the set 
of all rate pairs [R^s, satisfying 

R u <I(Xr,Y\X 2 ,U), 
R^ < I(X 2 ;Y\X h U), and 
^ + i? 2 - S </(X 1 ,X 2 ;y|C/), 

for random variables [U,X 1 ,X 2 ,Y] with values in U x X 1 x X 2 x y and joint distribution 

{q(u)q 1 (x 1 \u)q 2 (x 2 \u)p(y\x 1 ,x 2 )}ueu,xiex 1 ,x2eX2 > yey- Furthermore, the range U of the auxiliary 
random variable U has a cardinality bounded by \U\ < 2. 



6 



II. Capacity Region of Broadcast Phase 

In this section we present our main result, the capacity region of a broadcast channel where 
the receiving nodes have perfect knowledge about the message which should be transmitted to 
the other node. The capacity region can be achieved by classical channel coding principles. First 
we need to introduce some standard notation. 

Definition 2.1: Let X and y^, k = 1, 2, be finite sets. A discrete memoryless broadcast channel 
is defined by a family {p^ : X n — > y™ x 3^2/neN of probability transition functions given by 
p( n \y™,y™\x n ) := Y^ =l p{yii-,yi2\xi) for a probability transition function p : X — > x 3^2> i-e. 
{p(!/i>J/2|z)}a!6* lW 6yi,OTeMi is a stochastic matrix. 

In what follows we will suppress the super-index n in the definition of the n-th extension of 
the channel p, i.e. we will write simply p instead of p^ n \ This should cause no confusion since 
it will be always clear from the context which block length is under consideration. In addition, 
we will use the abbreviation V := Wi x W 2 , where Wi and W 2 denote the message sets. 

Definition 2.2: A (M[ n \ M 2 , n)-code for the bidirectional broadcast channel consists of one 
encoder at the relay node 

x n : V -> X n , 

and a decoder at node one and two 

91 ■■ y r i xW^W 2 U {0}, 

92 ■ y r 2 x W 2 -> Wi U {0}. 

The element in the definition of the decoders is included for convenience only and plays the 
role of an erasure symbol. 

When the relay node sends the message v = [wi,W2], the receiver of node one is in error if 
gi(Y™,wi) 7^ w 2 . The probability of this event is denoted by 

Xi(v) := P[g'i(Y 1 n , wx) ^ w 2 I x n (v) has been sent]. 

Accordingly, we denote the probability that the receiver of node two is in error by 

\ 2 (v) := P[g 2 (Y 2 n , w 2 ) ^ Wi \ x n (v) has been sent]. 

Hereby, Y™ and Y 2 n denote the random outputs at nodes 1 and 2 given that the sequence x n (v ) 
has been sent down the channel. This allows us to introduce the notation for the maximum and 



7 



average probability of error for the A;-th node 

A l n) : = maxA fc (u), := ^E A fcO)• 

Definition 2.3: A rate pair [R^, R^\ is said to be achievable for the bidirectional broadcast 
channel if for any 5 > there is an n(5) G N and a sequence of (M[ n \ M 2 (n) , n)-codes such that 
for all n > n(<5) we have }2EM£!. >R^- 5 and > _ s while A^ n) , A^ n) -> when 

n — > oo. The set of all achievable rate pairs is the capacity region of the bidirectional broadcast 
channel and is denoted by C B o 

Remark 2.4: Achievable rate pairs and a capacity region can be also defined for average 
probability of error. 

Theorem 2.5: The capacity region Cbc of the bidirectional memoryless broadcast channel is 
the set of all rate pairs [R^z, R^] satisfying 

%></(X;Y 2 |C/), 

(1) 

R^<I{X-Y X \U), 

for random variables [U, X, Yi, Y 2 ] with values in U x X x y x x y 2 and joint probability distribution 
{qi(u)q2(x\u)p(y 1 ,y 2 \x)} ueU>x ex ! y 1 ey im ey 2 . The cardinality of the range of U can be bounded 
by \U\ < 2. 

The theorem is proved in the following three subsections. In the first subsection we prove the 
achievability, i.e. a coding theorem. We prove a weak converse with respect to the maximum 
probability of error in the second subsection. Then the theorem is proved with the third subsection 
where we show that a cardinality of two is enough for the range of the auxiliary random variable. 

A. Proof of Achievability 

Here, we adapt the random coding proof for the degraded broadcast channel of [19] to our 
context. First, we prove the achievability of all rate pairs [R^, R^\ satisfying 

R^ 2 <I(X;Y 2 ), R^<I(X;Yi), (2) 

for some probability function p(x)p(yi,y 2 \x). Then, we extend this to prove that all points in 
the closure of the convex hull of © are achievable, which we will see is exactly the region 
stated in Theorem 12.51 
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1 ) Random codebook generation: We generate M% independent codewords X n (v), 
v = [wi,w 2 ] of length n with := 2 lnR ^ and M 2 (n) := 2 [nR ^ according to U7=iP( x i)- 

2) Encoding: To send the pair v = [wi,W2] with Wk E V\4, k = 1,2, the relay sends the 
corresponding codeword x n (v). 

3) Decoding: The receiving nodes use typical set decoding. First, we characterize the 
decoding sets. For the decoder at node k — 1, 2 let 

n 2 p(y n k ) 

with average mutual information I(X;Yk) := E x n i3/ n[/(x n ; y k )}. This gives the decoding set 

S(y n k ) :={x n eX n :I(x n ;y n k )> 



R^+I(X;Y k ) ' 



and indicator function 

1, if x n £S{yl) 



d{x\y n k , 



0, otherwise. 



When x n (v) with v = [wi,w 2 ] has been sent, and and y% have been received we say that 
the decoder at node k makes an error if either x n (v) is not in S(y k ) (occurring with probability 
Pe k (v)) or if at node one x n (wi,w 2 ) with w 2 ^ w 2 is in <S(y") or at node two x n (wi,w 2 ) 
with w\ ^ wi is in S(y r 2 l ) (occurring with Pf k {v)). If there is no or more than one codeword 
x n (wi, ■) E S(y[ l ) or x n (-,w 2 ) G S(yV; ), the decoders map on the erasure symbol 0. 

4) Analysis of the probability of error: From the union bound we have X k (v) < P^ k \v) + 



P®{v) with 



and 



P^(v):= E P{yl\x n (v))d{x n {v),yl) forA; = l,2 



p${v)'= E pG/IVM) E (i-^-k,^),^)), 

|Wi| 



i«i=i 



For uniformly distributed messages Wi and W 2 we define Pjg' := |^jjy^y E„ e wixw 2 P ij 
for m = 1,2 so that //j^ < Pj^ + Pfl- Next, we average over all codebooks, i.e. Ez«[/-4 ] — 



V 
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In the following, we show that if R^j> < I(X, Y k ) — 2e for any s > 0, we have E^l/tfe] — > 
when n — > oo. We have 

E jpW] — i — y E xn \P {1 hv)] 



e.fcJ ~~ 

for any 
fixed v 



dGWixW 2 



E EAp(y n k \x n (v))d(x n (v),y n k )} 
E E p(^)p(tfl*V(* B ,V*) 



=E B » ll( j[d(a» I y3f)]=P[d(x» ) i/J) = l] 
=P ' 



J(x";yg)< ^fflg» 



<P[J(x n ;^)</(X;y fc )-e] ^0 

n— »oo 

exponentially fast by the law of large numbers. For the calculation of E x n[P e j)] we have to 
distinguish between the receiving nodes. We present the analysis for k = 1, the case k = 2 
follows accordingly. Thereby, we use the fact that for v = [101,102] ^ [101,102] the random 
variables p(y™\X n (v)) and d(X n (wx, io 2 ), y") are independent for each choice of y± G y™. 



for any 
fixed 



1 111 Z| DeWixW 2 
|W 2 | 

EEjp(^V(tO)E (l-dC^.t&a),^)) 



U)2=l 



|W 2 | 



IW2I 



yfey? j&2=i 



\m\ 



E E p(vr) E p(s B )(i-d(* n ,i/D) 
— 1) e E ^w)- 
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Whenever x n G we have I(x n ;y?) = Mog 2 > \{R^ X + /(X;^)) or p(y?) < 

p(^|x n )2- n(fl m +/(X;yi))/2 . Consequently, 

E^fPiJ] < |W 2 | X] ^ p(x n )p( 2 /rk re )2"^ ( ^ +7(X;ri)) 

< 2 nR ri2~ n ^ R m+h I (X;Yi)) = 2 n(|ii sl -i/(X;r 1 )) < 2 -n £ > Q 

n— »oo 

Hence, if i?^ < Yj.), fc = 1,2, the average probability of error, averaged over codebooks 
and codewords, gets arbitrary small for sufficiently large block length n. 

5) Code Construction with arbitrary small maximum probability of error: If R^* < I(X; Y\) 
and R^* < I(X; Y 2 ) we can choose e > and n G N so that we have [/4 + /i4 ] < £■ 
Since the average probabilities of error over the codebooks is small, there exists at least one 
codebook C* with a small average probabilities of error fi^ 1 ' + /i^ < s. This implies that we 
have /i^ < e and ^ < e. We define sets 

Q ■= { v g V : AiO) < 8e and X 2 (v) < 8e} 
TZ k := {v G V : A fc 0) > 8^} for A; = 1,2. 

Since £ > hjt X^ev ^fe(^) — w'^e, we can bound the cardinality |72.fc| < ^ for k — 1, 2. Then 
from V = Qun 1 UTZ 2 it follows 

|Q| > |v| - 1^1 - \n 2 \ > ||v|. 

Now, let T be the set of w\ having the property that for each w\ there are at least \M 2 
choices of w 2 so that [ti^it^] G Q. Therefore, for W\ G T there are at most M 2 choices 
w 2 G W 2 and for w x ^ T there are less than |m|"' choices u; 2 G W2 such that [wi, w 2 ] G Q. 
Accordingly, we have 

\T\M { 2 n) + |Wi \ T|±M 2 (n) > |Q| > fM^M^ 

so that it follows that \T\ > |M 1 (n) using |Wi \ T\ = M 1 (n) - \T\. This means that there exists 
an index set Q* C Wi with indices Wi, to each of which we can find an index set 

Q 2 {wi) C W 2 with lM 2 n ^ indices u; 2 so that we have for each w\ G Q\ and w; 2 G Q 2 (wi) a 
maximum error Afc(wi,w 2 ) < 8e, k = 1,2. 

It follows that there exist one-to-one mappings $ : V* — > Q*, $1 : W* — > Q*, : W 2 — > 
Q 2 Oi) for each w x G Q* with $(wi,w 2 ) := [$iOi), $ 2 Ul (w 2 )] with sets V* := W* x >V 2 , 
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Wt := {l,2...,|Mf ) } for k = 1,2, Q* := {K,w 2 ] G V : Wi G Q?, w 2 G Q£(wi)} C Q. 
Accordingly, there exist mappings ^ fc : Q* —> W£, fc = 1,2, with v = [$ 1 (®(v)),ty 2 (®(v))]- 

This allows us finally to define a (|Mj , |M 2 n \ n)-code with an encoder x n : V* — > A?" 
with £ n 0) := x n ($(*;)) and decoders g x : x W* -> W 2 * and # 2 : x W 2 * -> }%* 
with <7i(j/i»wi) := ^ 2 (w 1 ,gi(yi,wi)) and g 2 (y 2 ,w 2 ) ■= ViMy* , w 2 ), w 2 ) where we use the 
mappings ^> k : V — ► W£ given by 

* fc (v), if u G Q* 
0, if u £ Q* 

for = 1,2. The idea is that the encoder uses only codewords x n (v) of the code C* with an 
index v G Q*, which have a maximum error Xk(v) < 8e, k = 1,2. Since the decoders use the 
typical set decoder of the code C*, they could erroneously find an x n (v ) with v G V \ <2*. In 
this case, the mapping Sfr k decides on the erasure symbol 0. It was already a wrong decision by 
the decoder g k , since the encoder chooses only codewords x n (v) with v G Q*. Therefore, this 
does not add any error to the decoding. The code has a rate pair [[n(R^ — ^)J , [n(R^ — i)J], 
which can be made arbitrary close to [R#$, -R^] when n — > oo. This proves the achievability of 
any rate pair satisfying the equation ©. 

<5) Convex hull: Let TZ(p(x)) denote the set of rates which we can achieve with the input 
distribution p(x). Since the cardinality of the input set X is finite, the rate region U P (^) ^(p( x )) 
is bounded. 

For k — 1, 2, we can rewrite the right hand side of CQ) as follows 

\u\ \u\ 
I{X-Y k \U) = J2p(u)I(X;Y k \U = u) = Ep(«)^5 | p(s | u) , 

where in /(X; Y^) | , , we choose a specific input distribution p(x\u) according to the auxiliary 
random variable U . For the input distribution p{x\u) we know from the first part of the proof 
that any rate pair R u G lZ(p(x\u)) C R 2 is achievable. Therefore, for any convex combination 
J2™=i a uRu we can regard the weights as probability mass function with p(u) := a u and 
u eU := {1,2,..., m} and choose for any u an input distribution p(x\u) that achieves the rate 
pair R u . For that reason, the conditional mutual informations given by the right hand sides of 
© are also achievable rates. □ 
The coding theorem usually offers a hint how to design a good channel code practically. 
Accordingly, in [20] an interesting coset coding strategy for symmetric channels is discussed. 
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In general in multi-terminal system the average and maximal error capacity region can be 
different. Ahlswede has shown for the two-way channel in [21] that "one cannot reduce a code 
with average errors to a code with maximal errors without an essential loss in code length 
or error probability, whereas for one-way channels it is unessential whether one uses average 
or maximal errors." The problem in the two-way channel is to find a maximal error sub-code 
with a Cartesian product structure. This problem is equivalent to a combinatorial problem by 
Zarankiewicz and arises since the transmitter and receiver have partial knowledge only. Here, the 
relay node has full knowledge so that for the code construction with arbitrarily small maximum 
probability of error we need not require a sub-code with Cartesian product structure. 

In the next subsection we prove the weak converse for the maximal error. Since the Fano's 
inequalities apply for the average error as well, the weak converse for the average error follows 
analogously. 

B. Proof of weak converse 

We have to show that any given sequence of (M[ n \ , n) -codes with A^ , A^ — > 
must satisfy JlogAff" 5 < I(X;Y 2 \U) + o(n°) and ±logM 2 (n) < I{X\Y X \U) + o(n°) for a 
joint probability distribution q\{u)q 2 {x\u) p(yi, y 2 \x). For a fixed block length n we define the 
joint probability distribution p(w u w 2 , x n , y?, y%) := fp^jp-j q 2 {x n \w x , w 2 ) Yli =1 p(yu, V2i\xi) 
on Wi x W 2 x X n x y™ x y% where the conditional distribution q 2 (x n \wi, w 2 ) = 1 if x n is the 
codeword corresponding to W\, w 2 or is equal to else. In what follows we consider for k — 1, 2 
uniformly distributed random variables Wk with values in W fc . 

Lemma 2.6: For our context we have the Fano's inequality 

H(W 2 \Y?, Wi) < Aj n) log \W 2 \ + 1 = nef\ (3) 

with 4 n) = !2sO^ A S n) + i -> for n -> oo as A^ n) -> 0. 

Proof: From Y"™ and W\ node 1 estimates the index W 2 from the sent codeword X n (W%, W 2 ). 
We define the event of an error at node 1 as 

|l, ifyi^Wi^Wa, 

|o, if yi (i7,wi)=w 2 , 
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so that we have for the mean probability of error /x^ = P[Ei — 1] < Af 1 . From the chain rule 
for entropies we have 

H(Et, W 2 \Yf, Wx) = H(W 2 \Y?, Wi) + H^Y?, W h W 2 ) 
= H(Ex\Y?, Wi) + H{W 2 \E, Y™, Wt) 

Since E\ is a function of W±, W 2 and Y™, we have H(E 1 \Y{ 1 , Wi, W 2 ) = 0. Further, since E\ 
is a binary-valued random variable, we get H(E 1 \Y{ 1 , W\) < H(Ei) < 1. So that finally with 
the next inequality 

H(W 2 \Y?, Wt, E x ) = PIE, = ti\H{W 2 \Y?, W u E x = 0) + = 1]H{W 2 \Y?, W x , E x = 1) 
< (1 - /iS n) )0 + /4 n) log(|W 2 | - 1) < A< n) log \W 2 \ 

we get Fano's inequality for our context. □ 
Therewith, we can bound the entropy H (W 2 ) as follows 

h{w 2 ) = h{w 2 \w x ) = i(w 2l r^iwo + H(W 2 \Y?, Wi) 

< I(W 2 ; Y?\Wi) + ne^ < I(W U W 2 , Y?) + ne { f ] 

< I(X n ; Y?) + ne^ < H(Y?) - H(Y?\X n ) + ne^ ] 

where the equations and inequalities follow from the independence of W\ and W 2 , the definition 
of mutual information, Lemma 1, the chain rule for mutual information, the positivity of mutual 
information, and the data processing inequality. If we divide the inequality by n we get the rate 

n . 

iH(W 2 ) < ^(HiY^Yt 1 ) - H(Y li \Yr\X n ))+e^ 

n n 

< ^(H(Y ll )-H(Y lt \X l ))+e^ = ±£l(y M ;X0 + 
i=i t=i 

using the memoryless property and again standard arguments. A similar derivation for the source 

rate \H (Wi) gives us the bound ±tf (Wi) < ^1^; Xj+e^ with e 2 n) = I^aM + I - 

i=i 

for n — > oo as \% — ► 0. 

This means that the entropies H(Wi) and H(W 2 ) are bounded by averages of the mutual infor- 
mations calculated at the empirical distribution in column i of the codebook. Therefore, we can 
rewrite these inequalities with an auxiliary random variable U, where U = i E U = {1,2, . . . ,n} 
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with probability -. We finish the proof of the converse with the following inequalities 

»)<iOTH;X 4 )+4 n) 
i=i 

n 

= Y i P(U = i)I(y li ;X i \U = i)+e™ 
i=i 

= I{Y W - X V \U) + 4 n) = I(Y i; X\U) + 4 n) 

and £if (Wi) < J(y 2 ; X|[/)+4 n) accordingly where £^ n) -»• 0, fc = 1, 2, when n — > oo. Thereby, 
Ffe := Yfe(7 and X := Xy are new random variables whose distribution depend on U in the same 
way as the distributions of Y ki and Xj depend on i. □ 
Up to now the auxiliary random variable U is defined on a set U with arbitrary cardinality. 
Next, we will show that \U\ — 2 is enough. 

C. Cardinality of set U 

With Fenchel-Bunt's extension of Caratheodory's theorem it follows that any rate pair in 
ConvexHull( UpfaO ^-(p( x ))) = Uu&t ^ip( x \ u )) ^ s achievable by time-sharing between two rate 
pairs from U P (x) i- e - 1^1 = 2 is enough. 

Theorem 2.7 ([22, Theorem 1.3.7]): If S C R n has no more than n connected components 
(in particular, if S is connected), then any x E ConvexHull(5) can be expressed as a convex 
combination of n elements of S. 

Since for any x G X we have [0, 0] e lZ(p(x)), the set \J p ^ lZ(p(x)) is connected. Therefore, 
any rate pair in C B c = ConvexHull( \J p r x \ TZ(p(x))') can be expressed as a convex combination 
of n = 2 rate pairs of [j p M TZ(p(x)). □ 

This finishes the proof of the capacity region of the bidirectional broadcast channel. 

Remark 2.8: Since the coding theorem includes the achievability of rate pairs in terms of the 
average probability of error and the proof of the weak converse for the average error works 
analogously, C B c is a l so me capacity region in terms of average probability of error. 

Remark 2.9: The characterization of the bidirectional broadcast capacity region for Gaussian 
channels is analogous. We would have to deal with discrete channels with Gaussian channel 
transfer distributions and would have to add an input power constraints but the arguments are 
similar to the arguments considered here. 
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In the next section we present the strong converse in the case of maximum probability error. 
Therefore, we will refine the achievability definition to [ex, £2] -achievable rate pairs. Then it 
follows from the strong converse for the maximum probability of error that the [ex, £2] -capacity 
region is equal C B c- Finally, from this we can deduce on the [ex, £2] -capacity region in terms of 
average probability of error for sufficiently small average error. 

III. Sharper Versions of the Converse Part for the Broadcast Phase 

Here, we derive a sharper converse to the coding theorem for the bidirectional broadcast 
channel. We prove the full strong converse for the capacity region defined with respect to the 
maximum error probability, i.e. Cecmax^i, £ z) = ^-bc for all £i,£2 G (0,1). Additionally, we 
show that the [ex, £2] -capacity region C^q^Ex, e 2 ) defined by using average error probability 
coincides with Cbc f° r small values of £i,£2 G (0, 1). 

The main tool we will use is the powerful blowing-up technique introduced by Ahlswede, 
Gacs and Korner in [15] based on the Blowing-up Lemma (cf. Marton's paper [23] for a simpler 
information-theoretic proof). The basic idea developed in [15] is that blowing-up the decoding 
sets in conjunction with a variant of Fano's inequality allows us to convert the weak converse 
into the strong converse to the coding theorem. 

Before entering the proof we recall the essential blowing-up notations and results which 
we need in the sequel: For a finite set y, n,l G N and B C y n we define the Hamming 
l-neighborhood by 

T l B:={yey n :d H (y,B)<l}, 

where dn denotes the non-normalized Hamming metric and dn{y',B) := min y / eB d H (y, y'). 
Theorem 3.1 (Ahlswede/Gdcs/Kdrner [15], cf. also [18], [23]): Let X and y be finite sets. 

1) For any sequence of positive integers {l n }nm with lim n ^ 00 = there exists a sequence 
{5 n (ln, I^DKeN with lim^^ S n (l n , \y\) = such that for any B C y n 

\v ln B\ < | J g|2 n<5 ' i ^ n, i^i) 

2) (Blowing-Up Lemma) To any sequence {r] n } neN with lim n ^oo?7 n = there exist a se- 
quence of positive integers {/„} ng N with lim^^ ^ = and a sequence {£ n } n eN with 
lim n ^ 00 e n = such that for every probability transition function p : X — >• y and every 
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n G N, x G X n , B C y n 

p {n \B\x) > 2- nr ' n implies p {n) iT ln B\x) >l-e n , 

where p^ : X n —> y n denotes the n-th memoryless extension of p. 

Remark 3.2: The second part of Theorem l3.1l is the uniform version of the Blowing-up lemma 
according to Csiszar/Korner [18] chap. 1.5. 

Since blowing up is an operation on the subsets of the output alphabet it is convenient to 
describe the decoding functions g x and g 2 by decoding sets. This equivalent description is 
obtained as follows; for each fixed w\ G Wi the map gx(-,Wi) : y™ — > W 2 U {0} induces 
a partition Vwi ■= {A^(^i)}™ 2 ew 2 u{c)} of y™. In a similar fashion for each w 2 G W 2 we 
obtain, using the decoder g 2 , a partition Q$ := {Bw± (w 2 )} Wl eWii){o} of the output set y%. 
Now if we are given the corresponding encoder x n : Wi x W 2 — > X n , the probabilities of error 
can be expressed by 



In what follows \ k , k = 1,2, denotes the maximum probability of error for a given code. 

A pair of non-negative reals [R^ is said to be [si, e 2 ] -achievable, ei,£ 2 G (0, 1), if for 
each 5 > there is a sequence of (M[ n \ M 2 n \ n) -codes such that for all sufficiently large n the 
following statements are fulfilled 

1 ) J log M[ n) >R^ 2 -S and ± log M 2 (n) > - 5. 

2) A[ n) < e k for k = 1,2. 

The set of all e 2 ] -achievable rates with respect to the maximum probability of error is denoted 
by C B c,max(£i,£2)- It is clear that C BC Q C BC ,max(£i, £2) and 



ei,e 2 e(0,l) 

hold. The content of the strong converse is that C B c cannot be a proper subset of CBC,max(£i> 
for ei,e 2 G (0, 1): 

Theorem 3.3: For memoryless bidirectional broadcast channel we have 



AiOi,w 2 ) =p((^K)) c |x"( Wl ,«; 2 )), 



and 



A 2 K, w 2 ) = p{{B'^\w 2 )r\x n { Wl , w 2 )). 





CbC — CBC,max(£ : l, £2) 
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for all ei,e 2 G (0, 1). 

Proof: Let [-R^rj, be an [ei, e 2 ] -achievable rate pair, thus, by definition, for any 6 > 
we can find a sequence of (M[ n \ M 2 , n) -codes and n(5) G N such that for all n > n(<5) 
following conditions are satisfied: 

1) i log M x (n) > i? r2 - 5 and i log M 2 (n) > - 5. 

2) A[ n) < e fc for k — 1,2. 

For those n we consider the families of partitions associated with the decoder maps, i.e. for 
each wi G Wi we have a partition Vw{ = {A^(wi)}«, 2e w 2 u{o} of an d analogously for 
each w 2 G W 2 a partition Q^} = {Swi\w 2 )} Wl eWiU{o} of suc h that for all W\ G Wi and 
u> 2 G W 2 we have 

P^WI 1 "^'^)) > 1 - ei > 2^", 

and 

p(B>)(^ 2 )|x"K,«; 2 )) > 1 -e 2 > 2~ nr >" 

where r) n : = - max{— log(l — s±), — log(l —£2)}- According to the second part of Theorem 13. II 
we can find a sequence of positive integers {/ n }neN with lim^oo — = such that for the sets 

A W2 ( Wl ) := r l ^f{ Wl ) and B Wl (w 2 ) := T l ^\w 2 ), 

we have 

p(A i0a (w 1 )\x n (w 1 ,w 2 )) >l-e n , (4) 

and 

p(B Wl (w 2 )\x n (w 1 , W 2 )) >l-£ n 

with limn^ooEn = 0. The sets {A W2 (wi)} W2& w 2 are not necessarily disjoint for different values 
of w 2 . The same applies to the sets {B Wl (w 2 )} Wl <z- Wl . Nevertheless, we show now that for any 
given w\ G Wi each G y™ ^ s contained in at most sub-exponentially many A W2 (wi). To this 
end, for any given y\ G y™ and w\ G Wi we define the set 

Oi(y£,Wi) := {w 2 G W 2 : G ^(ifli)}, 

and claim that 

\O l {y n 1) w l )\<2 n& ^\ y \\ (5) 
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with lim, woo 5 n (l n , \y\) — holds. The proof is given in [15]. We reproduce the full argument 
for convenience. It is obvious that w 2 G Oi(y",u>i) if and only if Awt' '(wi) H r ?n {?/"} 7^ 0- 
Therefore, since the sets {Aw™ (^i)}a) 2 ew 2 are disjoint, we have 

|Oi(yi,u>i)| < |r'"K}| < 2 rt<5 " ( ^- |3;|) 
with linin^oo 5 n (l n , \y\) = by the first part of Theorem 13.11 A similar result holds for the 
analogously defined set 02(2/2 i w 2}- 

Let us consider two independent, uniformly distributed random variables W\ and W 2 taking 
values in the sets Wi and W2 and a random variable X n with values in X n such that 

= x n (w u w 2 )\W 1 = w[, W 2 = w' 2 ) = S (wiim)i{w[><) . (6) 
Then the probability distribution of the whole system is given by 

p(w h w 2 , x n {w[, w' 2 ),y n , y 2 ) = M w M ( n) S(w 1 ,v»),(w r 1 ,w' a ) p{Vi, y 2 \x n {w[, w' 2 )) (7) 

for w k , w' k G W k , x n (w' 1 , w' 2 ) G X n , y% G ^ and k = 1, 2. 
Furthermore, for given ?/" G ^V" and u>i G Wi let us define 

e{y n ,w x ) := P(W 2 t 1 (y^w 1 )\Y 1 n = y n , ^ = (8) 

As in the proof of the weak converse one can show that 

log M 2 (n) = H(W 2 ) < I(X n ; Y?) + H(W 2 \Y?, (9) 

holds. Now, we need a variant of Fano's inequality which incorporates the quantity defined in 
®. Therefore, we use the following elementary entropy inequality: For a probability distribution 
p on a finite set A and an arbitrary B C A we have 

- J^pO) logpO) < -p(B) log p(B) + p{B)log\B\. (10) 
Then for given ?/! and w\ we set 

#(W 2 ) tf!>1 := H(W 2 \Y? = y?, m = 10O 

and obtain 

^(W / " 2 )^,«, 1 = - p(w2|j/i,wi)logj?(«;2|j/r,wi) 

X] p(^2br> w i) io gp( w 2br> w i) 

< H(e(y n llWl )) +e(ylw 1 )logMt ) + {1 - e{y n ,w x ))n5 n , 
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where we have applied eq. (flOl) to each sum and then used eq. © with the abbreviation 

8 n — $n(lm \y\)- H(e(y™,wi)) denotes the entropy of the distribution w{), 1 — e(y™, wi)). 

Averaging with respect to P(F 1 n = yi,W\ = w±) and using the concavity of the entropy we 
arrive at 

H(W 2 \Y 1 n ,W 1 ) < H(r n ) + Tn \ogM^ + (l-r n )nd n , (12) 

with r n := Ejtteivr.wiewi P ( y " = S/i > Wi = ^i)e(j/i > Note that by ®, our definition of 
X n in eq. © and © we have 

7"n = E E P( w ii w 2,yi) 

= E E t^^^i*>^)) 

= E w(n) w (n) P((^( W 0) C |g W (^l> W 2)) 
101,102 1 2 

< e„, (13) 

where the third equality holds since w 2 Oi(y™,Wi) iff y[ l ^ A W2 (wi) and the last inequality 
is by eq. ©. Thus ©, (O and ([13]) show that 

- io g M 2 ( " } < ^ J ( x "; *?) + ( 14 ) 



Similar reasoning shows that 



- log M[ n) < -I(X n ; Y 2 n ) + o(n°) (15) 
n n 



also holds. It is obvious that as in the proof of the weak converse the mutual informations on the 
right hand sides of (fl4l) and (fT5l) can be written as I(X; Yi\U n ) and I(X; Y 2 \U n ) for a suitable 
random variable U n taking values in {1, . . . ,n}. Note that by the proof of the coding theorem 
with the weak converse the rates I(X;Yi\U n ) and I(X;Y 2 \U n ) are achievable. Thus, we can 
conclude our proof by noting that for sufficiently large n we have 

flgt - * < - log M 2 W < - I(X n ; Y?) + o(n°) , 

and 

fl^ - 5 < - \ogM[ n) < -I(X n ; Y 2 n ) + o(n°), 
and that Cbc is closed. This shows that Cecmax^i, e z) C Cbc and we are done. □ 
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We give now the partial extension of Theorem 13.31 to the capacity region Cbc,sv{ £ i, £ 2) which is 
defined similarly to CBC,max(£i, £2) the difference being only that we use the average probability 
of error. Our strategy will be to reduce the statement to the Theorem 13.31 for sufficiently small 

ei,e 2 G (0,1). 

Corollary 3.4: For memoryless bidirectional broadcast channel it holds that 

for all £1 G (0, \) and e 2 G (0, \) or e x G (0, ~) and e 2 G (0, ~). 

Proof: Let [R^, R^*\ G C B c,av(^i, £2) with £x G (0, §) and e 2 G (0, |). Thus, for each 
5 > there is a sequence of (M-[ , n) -codes and n{5) G N with 

1) \ log M x (n) > R^ 2 - 5 and ± log M 2 (n) > - 5. 

2) /ii n) <£ fc for k = 1,2, 

for all n > n(5) where jj/ k denotes the average error probability. It is clear that < 1 iff 
e G (0, |) and 3^ < 1 iff e G (0, |). Therefore, we can choose real numbers a\ G (0,1) 
with j^- < a-i < 1 and a 2 G (0, 1) with 3yz^- < a 2 < 1. Let us consider the reals fk(£k) '■ = 
e k + a k (l - e k ) = (1 - a k )e k + a fe G (0, 1) for k = 1, 2. 
If we define the sets 

ft fc := {t; G Wi x W 2 : A fc (v) > f k (e k )}, 
for = 1, 2, from the Markov's inequality it is clear that 

\n h \ < e k (e k )Mi n) M^ n) (16) 

with e k (e k ) := f £ , k for /c = 1,2. For the set Q := (72-i U 1Z 2 ) C we obtain the following 
cardinality bound by (fT6l) : 

2 

iQl^a-^efcC^))^"^- (17) 

k=l 

Let 

T := {wi G Wi : there are at least e 2 (e 2 )M^ n) W2 G W 2 with (w 1 ,w 2 ) G Q}. (18) 

Our goal now is to find a lower bound on the cardinality of T. To this end, note that for w\ G T 
there are at most M 2 message indices iu 2 G W 2 with (wi,w 2 ) G Q and for u>i ^ T there are 
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r(») 

message m< 

9 



at most e 2 (e 2 )M 2 message indices w 2 £ W2 with (101,102) € Q- Thus by (fT7l) 

2 



and therefore 



- Ee fc (e fe ))Mr ; Mf J < |Q| < Mnr\+e 2 (e 2 )M^\W l \n 

k=l 

\r\ > iz^iM^M M f >. (19) 

1 - e 2 (e 2 ) 

The first factor on the right hand side of (fT9~l) is positive due to our restriction to S\ £ (0, |) and 
e 2 £ (0, \). Indeed, it is easily seen that e\{e\) < \ iff j 1 ^- < ai and this last relation is true 
by our choice of a\ which is possible due to our restriction to E\ £ (0, \). Similarly, we have 
62(^2) < \ iff 3^z^ < a 2 which is satisfied since e 2 £ (0, |). 
Now we set 



N[ n) :-- 



ei(ei) - 2e 2 (£ 2 ) M (n)' 



e 2 (£2) 



and 



iV 2 (n) :-- 



e 2 (e 2 )M ( 2 



As in the proof of the direct part of the coding theorem we can construct a sequence of 
(N[ n \N^\n) -codes from the given sequence of codes but with the additional property that 
the new sequence has the maximum error probabilities bounded by fi{s\) and f 2 (e 2 ). The new 
sequence of codes achieves the rate pair [R^, R^*\. Thus, we can apply our Theorem 13.31 to 
conclude that for E\ £ (0, ^) and e 2 £ (0, |) [R^, R^*\ £ Cbo If we interchange the roles of 
Wi and W2 in definition of the set T in (TT8T) and at the same time swap the numbers 
and f 2 (e 2 ), we can conclude in a similar fashion that Cbc = CBc,av(^i, £2) for £1 £ (0, |) and 
£ 2 e(0,|). □ 

IV. Discussion 

The coding principles of the bidirectional broadcast are similar to the network coding approach 
where we would have implemented a bitwise XOR operation on the decoded messages at the 
relay node [7], [8]. But since network coding [9], [10] is originally a multi-terminal source 
coding problem, the achievable rates in the broadcast phase using the network coding approach 
are limited by the worst receiver. This means that with a network coding approach we can 
achieve 

%>, R m < min{/(X; Y 1 ), I(X; Y 2 )} 
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for some common input distribution p(x). The achievable rates depend on the common input 
distribution and both channel transfer distributions. For our coding scheme each achievable rate 
depends on the common input distribution and its own channel transfer distribution only. For 
each channel we can separately find the optimal input distribution which achieves the maximal 
achievable rate for this link (equal to the single link capacity), but the optimal input distribution 
for one channel needs not be optimal for the other channel!! 

Accordingly, we see that the network coding approach using XOR on the decoded messages 
at the relay is in general inferior, but it achieves the capacity of the bidirectional broadcast if 
and only if for the maximizing input distribution p*(x) = arg max^ max{/(X; Yi), I(X; Y 2 )} 
we have I(X; = I(X; Y 2 ). 

In the following we will discuss the bidirectional broadcast for a binary symmetric broadcast 
channel and the achievable rate region of two-phase bidirectional relaying protocol. 

A. Binary Symmetric Broadcast Channel 

For the binary symmetric broadcast channel, let p 1 and p 2 denote the probability that a relay 
input X £ {0, 1} is complemented at the output Yi £ {0, 1} and Y 2 £ {0, 1} of node 1 and 2 
respectively. From [24, Chapter 8.1.4] we know that a uniform input distribution maximizes the 
binary symmetric channel. Therefore, the broadcast capacity region for the binary symmetric 
channel is given by 

C BC = [0, 1 - H{p 2 )\ x [0, 1 - H( Pl )}, (20) 

which includes the region [0, 1 — max{if (pi), H(p 2 )}} x [0, 1 — max{i7(p 1 ), H(p 2 )}] achievable 
using XOR at the relay node according to [7]. 

B. Achievable Bidirectional Rate Region 

We will now look at the achievable bidirectional rate region where we use in each phase the 
optimal strategies. Thereby, we optimize the time-division between the MAC phase with mem- 
oryless multiple access channel p(y\xi, x 2 ) and BC phase with memoryless broadcast channel 



'it is curious that if we transfer the result to scalar Gaussian channels with a mean power constraint, obviously the Gaussian 
input distribution will maximize both links simultaneously. For the vector valued Gaussian channel this is no longer the case. 
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p(yi,y 2 \x). Of course, due to the a priori separation into two phases, this strategy need not be 
the optimal strategy for the bidirectional relay channel. 

Let Ri and R 2 denote the achievable rates for transmitting a messages w± from node 1 to 
node 2 and a message w 2 from node 2 to node 1 with the support of the relay node. In more 
detail, node 1 wants to transmit message Wi with rate nRi in n channel uses of the bidirectional 
relay channel to node 2. Simultaneously, node 2 wants to transmit message w 2 with rate nR 2 in 
n channel uses to node 1 . Then let hmac and n BC = n — timac denote the number of channel 
uses in the MAC phase and BC phase with the property — > a £ [0, 1] and ^ — > 1 - a 
when n — > 00, respectively. We call a the time-division factor between multiple access and 
broadcast phase. With a sufficient block length n (respectively n M Ac and ^bc) we can achieve 
a bidirectional transmission of messages w\ and w 2 with arbitrary small decoding error if rate 
pairs [Ry£, R^} £ Cmac and [Rjfi, R^} £ Cbc exist so that we have 

nRi < min{n MAC R^,n B cR^}, 

nR 2 < min{n M AC-R^,^BC-R|^}- 

Thus, the achievable rate region of the bidirectional relay channel using time-division is given 
by the set of all rate pairs [Rj, R 2 ] which are achievable with any time-division factor a £ [0, 1] 
as n — >• 00. We collect the previous consideration in the following proposition. 

Proposition 4.1: The achievable rate region IZ-qrc of the two-phase bidirectional relay channel 
is given by 

^brc = {[Ri, R2] e R 2 : Ri < wmiaRft, (1 - a)R&}, 
R 2 < mm{aR^, (1 — a)R^} with a £ (0, 1), 

[^ir'^2r] e ^mac ; an d [R^,R^\ £ C B c}- 
Since Cbc ls larger than the region of the broadcast phase achieved by applying interference 
cancellation [5], [6] and XOR on the decoded messages at the relay node [7], [8], [10], the 
achievable rate region 7^brc includes the region which can be achieved by interference cancel- 
lation and network coding approaches. 

Finally, we briefly look at an example with binary channels. In Figure [2] we depicted the 
capacity region Cmac and Cbc and the achievable rate region 7^brc with a symmetric binary 
erasure multiple access channel [24, Example 14.3.3] and a binary symmetric broadcast channel, 
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/ n R1 
1 

1/2 

1/2 1 R ^R ff2 1/4 1/2 R, 

Fig. 2. The left figure shows the capacity regions Cmac (dotted line) and Cbc (dashed line), the right figure shows the 
corresponding achievable rate region TZbkc (solid line). The dashed-dotted line exemplarily shows for one angle (f> the achievable 
rate pair (•) on the boundary of TZbhc with the optimal time-division between the two rate pairs ( x ) on the boundary of Cmac 
and Cbc- 

cf. equation (|2Q|) . The boundary of the achievable rate region can be obtained geometrically if 
one takes for any angle 4> G [0, 7r/2] half of the arithmetical mean between the boundary rate 
pairs of the capacity regions where we have tan0 = R^/ = R^*/R^*. 

V. Conclusion 

In this work we present the broadcast capacity region of the two-phase bidirectional relay 
channel. Thereby, each receiving node has perfect knowledge about the message intended for 
the other node. Furthermore, the proposed achievable rate region of the two-phase bidirectional 
relay channel is in general larger than the rate region which can be achieved by applying the 
network coding principle on the decoded data. The coding theorem and weak converse are easily 
extended to Gaussian channels with input power constraints. 

We have also shown the strong converse with respect to the maximum error criterion for the 
broadcast phase. This result implies then that the capacity region defined with respect to the 
average error probability remains constant for all error parameters [£1,62] £ (0, \) X (0, \) or 
[e u e 2 ] E (0,|) x (0,i). 
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